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Abstract 

Sex-biased gene expression (i.e., the differential expression of genes between males and females) is common among sexually 
reproducing species. However, genes often differ in their sex-bias classification or degree of sex bias between species. There 
is also an unequal distribution of sex-biased genes (especially male-biased genes) between the X chromosome and the 
autosomes. We used whole-genome expression data and evolutionary rate estimates for two different Drosophilid lineages, 
melanogaster and obscura, spanning an evolutionary time scale of around 50 Myr to investigate the influence of sex-biased 
gene expression and chromosomal location on the rate of molecular evolution. In both lineages, the rate of protein evolution 
correlated positively with the male/female expression ratio. Genes with highly male-biased expression, genes expressed 
specifically in male reproductive tissues, and genes with conserved male-biased expression over long evolutionary time scales 
showed the fastest rates of evolution. An analysis of sex-biased gene evolution in both lineages revealed evidence for a "fast- 
X" effect in which the rate of evolution was greater for X-linked than for autosomal genes. This pattern was particularly 
pronounced for male-biased genes. Genes located on the obscura "neo-X" chromosome, which originated from a recent X- 
autosome fusion, showed rates of evolution that were intermediate between genes located on the ancestral X-chromosome 
and the autosomes. This suggests that the shift to X-linkage led to an increase in the rate of molecular evolution. 
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Introduction 

Sexual dimorphism, that is, the physical differentiation of 
males and females of a species, is widespread across the an- 
imal kingdom. Most of these differences can be attributed 
to the evolution of differential gene expression in the two 
sexes (reviewed by Ellegren and Parsch 2007). Genes that 
differ in expression level between males and females are 
commonly referred to as sex-biased genes and can be fur- 
ther divided into male- and female-biased genes, depending 
on which sex shows higher expression, while genes with 
similar expression levels in the two sexes are referred to 
as unbiased. Previous population genetic and comparative 
genomic studies of Drosophila melanogaster and its sister 
species D. simulans revealed that male-biased genes have 
increased levels of amino acid divergence between species 
(Zhang et al. 2004; Gnad and Parsch 2006) and particularly 
high rates of adaptive evolution (Proschel et al. 2006; 



Sawyer et al. 2007; Baines et al. 2008). For species outside 
the well-studied me/anogaster subgroup, the situation is less 
clear. Initial studies of D. ananassae and D. pseudoobscura 
that used a limited number of genes could not confirm the 
pattern of faster evolution for male-biased genes (Metta 
et al. 2006; Grath et al. 2009). However, a whole-genome 
study reported accelerated rates of protein evolution for 
male-biased genes between two closely related species in 
the obscura group (Jiang and Machado 2009). 

Many factors contribute to variation in rates of evolution 
between different proteins in Drosophila by either influenc- 
ing the rate of evolution itself or imposing evolutionary 
constraints. Larracuente et al. (2008) identified expression 
level, intron and protein length, intron number, number 
of protein-protein interactions, recombination rate, and 
translational selection as possible affectors. In one recent 
study, protein secondary structure was found to influence 
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rates of positive selection in Drosophila (Ridout et al. 2010). 
These authors found that amino acids forming disordered 
regions, for example, random coils, are more likely to expe- 
rience positive selection than amino acids situated in helices 
and p-structures. For D. melanogaster and Mus musculus, it 
has been shown that sex-biased genes showing tissue- 
specific gene expression in reproductive tissues, and thus 
having a narrow expression profile within the organism, 
show faster rates of evolution than unbiased genes with 
tissue-specific expression or those with broader expression 
profiles (Meisel 2011). 

The chromosomal location of genes (i.e., whether they 
are sex-linked or autosomal) may also influence their rate 
of evolution. Theory predicts that X-linked genes (or Z-linked 
genes in female heterogametic taxa) should exhibit 
a "fast-X" effect if adaptation occurs primarily through 
new beneficial mutations that are, on average, recessive 
(Charlesworth et al. 1 987). This is because recessive X-linked 
mutations are immediately subject to selection in hemizy- 
gous males. However, a fast-X effect is not expected when 
adaptation occurs from standing genetic variation (Orr and 
Betancourt 2001), as may be the case when there is rapid 
environmental change (Karasov et al. 201 0). In addition, fac- 
tors such as the overall effective population size and the rel- 
ative effective population sizes of males and females, may 
also affect rates of evolution of the X chromosome and au- 
tosomes differently (Vicoso and Charlesworth 2009). These 
factors may explain why a clear fast-X (or fast-Z) effect has 
been observed in mammals and birds, but not in Drosophila 
(Mank et al. 2009). In Drosophila, the fast-X effect is ex- 
pected to be small (Mank et al. 2009) and is typically not 
observed in whole genome studies that examine ratios of 
nonsynonymous-to-synonymous divergence (e.g., Thornton 
et al. 2006). However, if selection on sex-biased genes oc- 
curs mainly in the sex that shows enriched expression, as 
appears to be the case in Drosophila (Connallon and Clarke 
201 1), then male-biased genes should show the strongest 
fast-X effect because they are most often subject to selec- 
tion in a hemizygous background (Baines et al. 2008). Con- 
sistent with this, male-biased genes have been observed to 
exhibit a large fast-X effect in the D. melanogaster subgroup 
(Baines et al. 2008). 

In this report, we examine the influence of the degree 
and conservation of sex-biased gene expression, as well 
as chromosomal location, on the rate of molecular evolution 
in two independent Drosophilid lineages, melanogaster and 
obscura. These lineages diverged from each other around 
50 Ma (Bergman et al. 2002; Tamura et al. 2004). An inter- 
esting difference between the two lineages is the presence 
of a "neo-X" chromosome in the obscura lineage, which is 
the result of a fusion of an autosome and the X chromosome 
that occurred about 8-1 2 Ma (Tamura et al. 2004; Richards 
et al. 2005; Gurbich and Bachtrog 2008; Bachtrog et al. 
2009). We find that there is a positive correlation between 



the degree of male-biased expression and the rate of protein 
evolution in both lineages. Genes with conserved male- 
biased expression between the lineages show the fastest 
rates of evolution. For both lineages, we observe a fast-X 
effect that is especially strong for male-biased genes. This 
effect is greater for the ancestral X chromosome but is also 
present on the neo-X chromosome of the obscura lineage. 

Materials and Methods 

Data Sets and Species 

For both the melanogaster and the obscura lineage, we ex- 
tracted male/female (M/F) expression ratios and data on the 
rate of protein evolution measured by the ratio of nonsynon- 
ymous to synonymous substitutions (d N /d s ) from the Sebida 
database (v. 2.0; Gnad and Parsch 2006; http://www. 
sebida.de). The melanogaster data originate from a meta- 
analysis of D. melanogaster sex-biased gene expression over 
several studies (Parisi et al. 2003; Ranz et al. 2003; Gibson 
et al. 2004; Parisi et al. 2004; Mclntyre et al. 2006; Ayroles 
et al. 2009), while data for the obscura lineage come from 
a microarray analysis of D. pseudoobscura (Jiang and 
Machado 2009). All M/F expression ratios were log 2 trans- 
formed. We excluded genes for which no FlyBase identifier 
(FBgn) could be associated, as well as genes lacking infor- 
mation on chromosomal location, d N /d Sl or expression state. 
For the melanogaster lineage, d N /d s values come from 
a comparison of D. melanogaster and D. simulans. For 
the obscura lineage, d N /d s values come from a comparison 
of D. pseudoobscura and D. persimilis. In both cases, we ex- 
cluded genes with d N /d 5 > 9, as they tend to be unreliable 
estimates where d 5 is equal to (or very close to) zero. The 
final data set contained 12,419 genes (10,437 autosomal 
and 1,982 X linked) for the melanogaster lineage and 
10,1 18 genes (6,641 autosomal, 1,657 located on the left 
arm of the X chromosome [XL], and 1,820 located on the 
right arm of the X chromosome [XR]) for the obscura line- 
age. Orthologs between D. melanogaster and D. pseu- 
doobscura could be identified for 8,439 genes. In 
addition, we used expression data from D. ananassae 
(Zhang et al. 2007) to infer M/F expression ratios of genes 
of this species. Drosophila ananassae is phylogenetically sit- 
uated at the split of the melanogaster subgroup from the 
melanogaster group {Drosophila 12 Genomes Consortium 
et al. 2007; Larracuente et al. 2008) and was used to infer 
gains or losses of sex-biased expression within the mela- 
nogaster group. Orthologs among all three species could 
be identified for 5,336 genes. 

Analysis of Rates of Evolution with Respect to Degree of 
Sex-Bias 

Correlations between log 2 (M/F) and the rate of protein 
evolution {d N /d s ) were assessed using the nonparametric 
Spearman's rank correlation for each lineage. In addition, 
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correlations were determined separately for male-, female-, 
and unbiased genes. For each lineage, we also ranked all 
genes by their M/F expression ratio and separated them into 
five equally sized groups (highly male biased, weakly male 
biased, unbiased, weakly female biased, and highly female 
biased). Comparisons among groups were performed using 
Kruskal— Wallis tests. Significant differences were further in- 
vestigated using pairwise Mann-Whitney U tests between 
groups with Bonferroni correction. 

Analysis of Rates of Evolution with Respect to Tissue- 
Specific Expression 

We used FlyAtlas (Chintapalli et al. 2007) and the approach 
of Meisel (201 1) to determine the expression breadth of all 
genes in our D. melanogaster data set. We considered 10 
somatic tissues shared by males and females (brain, eye, 
thoracioabdominal ganglion, salivary gland, crop, midgut, 
Malpighian tubule, hindgut, heart, and fat body), two 
male-specific tissues (accessory gland and testis), and two 
female-specific tissues (spermatheca and ovary). A gene 
was considered as expressed in a given tissue if its mean 
microarray signal intensity was >1 00 (Meisel 2011). Since 
tissue-specific expression data were not available for 
D. pseudoobscura, we made the assumption that D. pseu- 
doobscura genes shared the expression pattern of their 
D. melanogaster orthologs. In the end, we were able to 
assign tissue expression patterns to 1 1 ,082 D. melanogaster 
genes and 7,757 D. pseudoobscura orthologs. For compar- 
isons of evolutionary rates, we separated sex-biased genes 
(male and female) into two groups: 1 ) those with expression 
limited to male (or female) reproductive tissues and 2) those 
expressed in one or more nonreproductive tissue. To control 
for the possible accelerated evolution of genes with tissue- 
specific expression (Haerty et al. 2007; Meisel 201 1), we also 
compared the above genes to a set of unbiased genes that 
showed expression in only a single somatic tissue. 

Inferring the Conservation of Sex-Biased Gene Expression 

To examine the conservation of sex-biased gene expression, 
we investigated the M/F ratios of orthologous genes be- 
tween the two lineages, melanogaster and obscura. First, 
we compared the rate of protein evolution of genes with 
conserved sex-biased expression between D. melanogaster 
and D. pseudoobscura to that of genes that differed in their 
sex-bias classification between species. We divided the set 
of genes into nine categories according to their expression 
conservation (see supplementary table 1, Supplementary 
Material online). Second, we analyzed the conservation of 
degree of sex bias between these orthologs for male- 
and female-biased genes. We ranked the sex-biased genes 
according to their degree of sex-bias within each species and 
compared the overlap of genes between the species for the 
top 10% and top 25% of genes. Subsequently, data from 



D. ananassae were used to infer changes in sex-biased gene 
expression along the phylogeny. Each category from the 
above analysis (supplementary table 1, Supplementary Ma- 
terial online) can be subdivided into three groups (male-, 
female-, or unbiased) according to the expression state in 
D. ananassae. Thus, the expression pattern in the three spe- 
cies can be separated into 27 groups (see supplementary 
table 2, Supplementary Material online). A gain or loss of 
sex-biased gene expression along the phylogeny was in- 
ferred using parsimony. First, we compared the groups with 
gene expression states conserved over all three species. Sec- 
ond, we performed comparisons between groups of genes 
with conserved sex-biased gene expression and those that 
differed in sex-bias classification among species. All compar- 
isons were performed using pairwise Mann-Whitney U tests 
between groups with Bonferroni correction and were car- 
ried out on the complete data sets and on autosomal 
and X-linked genes separately. Spearman rank correlations 
were used to assess the correlation between rates of pro- 
tein evolution of orthologs on the two lineages, as well 
as the correlation between M/F expression ratios on the 
two lineages. 

Influence of Chromosomal Location and Gene 
Conservation 

To test for a possible "fast-X" effect, we compared the rates 
of evolution (measured as d N /d 5 ) between autosomal and X- 
linked genes. For the obscura lineage, X-linked genes were 
further divided into genes situated on the XR and genes 
situated on the XL. Genes on XR originate from a recent 
(-8-12 Ma) X-autosome fusion in the obscura clade and 
are generally located on the autosomal arm 3L in D. mela- 
nogaster Differences between groups were determined 
using Kruskal-Wallis tests and subsequent pairwise 
Mann-Whitney U tests with Bonferroni correction. To test 
for lineage-specific differences and the influence of homol- 
ogous gene conservation between lineages, we also per- 
formed the above analyses on the set of orthologous 
genes between the melanogaster and obscura lineages. 
All statistical analyses were performed using R 2.10.1 
(R Development Core Team 2010). 

Results 

Correlation between Degree of Male-Bias and Rate of 
Protein Evolution 

For both the melanogaster and the obscura lineage, there 
was a significant positive correlation between the rate of 
protein evolution (d N /d 5 ) and the ratio of male-to-female 
(M/F) expression (Spearman's rank correlation, melanogast- 
er. p = 0.16, P < 0.001; obscura: p = 0.17, P < 0.001). 
This was true for the complete set of genes, as well as 
the autosomal and X-linked genes considered separately 
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Table 1 

Correlations between d/y/ds and M/F Expression Ratio 



Melanogaster Lineage Obscura Lineage 



Genes 3 


Chromosome 




P C 


pd 


yv b 


P C 


pd 


All 


All 


12,419 


0.16 


<0.001 


10,118 


0.17 


<0.001 




Auto 


10,437 


0.17 


<0.001 


6,641 


0.17 


<0.001 




X (or XL) 


1,982 


0.18 


<0.001 


1,657 


0.25 


<0.001 




XR 






_ 


1,820 


0.17 


<0.001 


M 


All 


3,381 


0.34 


<0.001 


3,295 


0.07 


<0.001 




Auto 


2,350 


0.30 


<0.001 


2,350 


0.07 


<0.001 




X (or XL) 


395 


0.38 


<0.001 


470 


0.15 


<0.001 




XR 








475 


0.07 


0.11 


F 


All 


4,983 


-0.01 


0.31 


4,761 


0.07 


<0.001 




Auto 


4,053 


-0.01 


0.52 


2,954 


0.07 


<0.001 




X (or XL) 


929 


-0.03 


0.42 


840 


0.06 


0.09 




XR 








967 


0.08 


0.01 


U 


All 


4,055 


-0.01 


0.50 


2,062 


0.01 


0.52 




Auto 


3,397 


-0.01 


0.79 


1,337 


0.02 


0.55 




X (or XL) 


658 


-0.03 


0.39 


347 


0.01 


0.81 




XR 








378 


0.02 


0.75 



a "M" indicates male-biased genes, "F" indicates female-biased genes, and "U" 
indicates unbiased genes. 
b Number of genes. 
c Spearman's p. 

d As determined by Spearman's rank correlation. 



(melanogaster autosomes: p = 0.17, P < 0.001; obscura 
autosomes: p = 0.17, P < 0.001; melanogaster X: p = 
0. 1 8, P < 0.001 ; obscura X: p = 0.25, P = 0.03). Particularly 
for the melanogaster lineage, the above correlations were 
much stronger for male-biased genes than for female- 
and unbiased genes (table 1). 

When the genes were placed into categories according to 
their type and degree of sex-biased expression, a general 
pattern of faster evolution for highly male-biased genes 
was evident (fig. 1). This pattern held for both autosomal 
and X-linked genes (supplementary table 3, Supplementary 
Material online). 

Previous studies have shown that the vast majority of sex- 
biased genes differ in expression between male and female 
reproductive tissues (Parisi et al. 2003) and that reproductive 
tissue-expressed genes show an accelerated rate of evolu- 
tion relative to sex-biased genes expressed in nonproduc- 
tive tissues (Meisel 201 1). We could confirm this pattern in 
our data set (fig. 2). Although tissue-specific genes are 
known to evolve faster than those with broad expression 
patterns (Haerty et al. 2007; Meisel 201 1 ), this cannot com- 
pletely explain our observations, as genes expressed in male 
or female reproductive tissues had significantly faster rates 
of evolution than unbiased genes that were expressed in 
a single somatic tissue (fig. 2A). Because tissue-specific ex- 
pression has not been systematically analyzed in D. pseu- 
doobscura, we could not directly test for its effects on 
evolutionary rate. However, under the assumption that D. 
pseudoobscura genes share the expression pattern of their 
D. melanogaster orthologs, we find that the above results 



also hold for the obscura lineage (fig. 2B). Note that this 
approach may exclude a disproportionate number of male- 
biased or male reproductive tissue-expressed genes, as they 
tend to show high turnover between species (Zhang et al. 
2004; Levine et al. 2006; Baines et al. 2008). Despite this lim- 
itation, we see remarkable similarity between the two line- 
ages, especially for male reproductive genes (fig. 2). 

We also compared d N /d 5 of the X-linked and autosomal 
genes within each expression class. In all cases, X-linked 
genes had faster rates of protein evolution than autosomal 
genes (fig. 3). For male-biased genes, the difference was sig- 
nificant in both lineages (table 2). For the obscura lineage, 
genes located on the "neo-X" chromosome (XL) consistently 
showed faster rates of evolution than genes located on XR 
(fig. 3 and table 2). 

When both sex- and tissue-specific expression patterns 
were considered, there was a highly significant fast-X effect 
for male-biased reproductive tissue-specific genes in the 
melanogaster lineage (fig. 4 and table 3). A weaker fast- 
X effect was observed for male-biased genes with expres- 
sion outside of reproductive tissues, but was not significant 
after correcting for multiple tests (table 3). There was no ev- 
idence for faster evolution of X-linked genes with expression 
limited to female reproductive tissues. In the obscura line- 
age, male-biased genes located on chromosome arm XL 
showed a highly significant fast-X effect, regardless of 
whether their expression was limited to reproductive tissues 
or not (fig. 4 and table 3). Female-biased genes expressed 
outside of reproductive tissues located on XL showed signif- 
icantly faster evolution than autosomal genes (table 3). 

Influence of Gain/Loss and Conservation of Sex-Biased 
Expression 

For the set of orthologous genes between D. melanogaster 
and D. obscura, we determined the conservation of sex- 
biased gene expression and separated the genes into nine 
groups (supplementary table 1 , Supplementary Material on- 
line). There were significant differences in the rate of protein 
evolution among the nine groups (Kruskal-Wallis test, P < 
0.001). We also performed pairwise comparisons of groups 
using Mann-Whitney U tests with Bonferroni correction. 
Genes with conserved male-biased expression had signifi- 
cantly higher d N /d 5 values than genes of all other groups, 
whereas there was typically no significant difference be- 
tween genes with male-biased expression in only one spe- 
cies (supplementary table 4, Supplementary Material 
online). Furthermore, conserved female-biased genes did 
not show differences in the rate of evolution compared with 
genes with female-biased gene expression in only one spe- 
cies (supplementary table 4, Supplementary Material on- 
line). There was a significant positive correlation between 
the M/F ratios for the orthologous genes of the two lineages 
(Spearman's p = 0.76, P < 0.001; fig. 5A). However, we 
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Highly 
male-biased 

6.4 >M/F> 0.50 



Weakly 
male-biased 

0.50 > M/F > 0.07 



unbiased 



Weakly 
female-biased 



Highly 
female-biased 



0.07 > M/F > -0.22 -0.22 > M/F > -0.72 -0.72 > M/F > -4.93 




Highly 
male-biased 

4.92 > M/F > 0.31 



Weakly 
male-biased 

0.31 > M/F > 0.03 



unbiased 



Weakly 
female-biased 



Highly 
female-biased 



0.03 > M/F >-0.16 -0.16 > M/F >-0.42 -0.42 > M/F > -4.96 



Fig. 1. — Evolutionary rates of male-, female-, and unbiased genes. Genes were ranked according to M/F expression ratios and divided into five 
equally sized groups for each lineage. The M/F cutoffs (on a log 2 scale) used to create these groups are given in parentheses. Panel (A) shows 
comparisons for the melanogaster lineage. Each group contains 2,483 or 2,484 genes. Panel (B) shows comparisons for the obscura lineage. Each group 
contains 2,023 or 2,024 genes. The heavy horizontal line in each box indicates the median, with notches at the side indicating its 95% confidence 
interval. The edges of each box represent the bounds of the upper and lower quartiles, that is, the box shows the interquartile range. The dotted lines 
("whiskers") on either side of the box indicate the adjacent values. The upper (lower) adjacent value is the value of the largest (smallest) observation that 
is less (greater) than or equal to the upper (lower) quartile plus 1.5 times the length of the interquartile range. For both lineages, highly male-biased 
genes had significantly higher d N /d s than genes of all other categories (Mann-Whitney U test, P < 0.001 in all cases). Significance levels for all 
comparisons are given in supplementary table 3 (Supplementary Material online). 



observed only moderate conservation of the degree of sex 
bias for both male- and female-biased genes. For each lin- 
eage, genes with sex-biased expression (1,681 male-biased 
genes and 3,390 female-biased genes) were ranked accord- 
ing to their degree of sex bias and the overlap between the 
top 10% and top 25% of genes in each species was deter- 
mined. The proportion of genes that were in the top 10% 
for the melanogaster lineage and also in the top 1 0% for the 
obscura lineage was 41 % for male-biased genes and 45% 
for female-biased genes. For the top 25% of genes, the 
overlap was 54% for the male-biased genes and 50% for 
the female-biased genes. 

In addition, we investigated whether or not the rate of 
evolution of sex-biased genes was affected by gain or loss 
of sex-biased gene expression along the Drosophilid phylog- 
eny. Estimates of sex-biased gene expression in D. ananas- 
sae were used to infer gains or losses of sex-biased 
expression within the melanogaster group. This allowed 
the genes to be separated into 27 groups (supplementary 



table 2, Supplementary Material online) according to sex- 
bias conservation. There were significant differences among 
the groups in both lineages (Kruskal-Wallis test, P < 0.001 ). 
Consistent with previous studies (Metta et al. 2006; Baines 
et al. 2008), genes with conserved male-biased expression 
showed significantly higher rates of evolution than female- 
biased or unbiased genes (fig. 6A and D). However, some 
groups only consisted of very few genes and showed high 
variation in d N /d 5 . 

Because some of the above groups contained few genes, 
we focused on the comparison of genes that recently gained 
sex-biased expression to those that recently lost sex-biased 
expression. For both male- and female-biased genes, we 
compared genes that gained sex-biased expression in either 
D. melanogaster or D. ananassae (UMU, MUU and UFU, 
FUU) with genes that lost sex-biased expression in one of 
the two species (UMM, MUM and UFF, FUF). For male-biased 
genes, genes that recently lost male-biased expression 
showed higher rates of evolution in both lineages 
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A 1.0- 

^ 08 " 
f 0.6 H 

CD 

E 

^ 0.4 H 

z: 

0.2- 
0.0- 

B i.o- 

0.8- 



i 0.6- 

CD 



0.2 
0.0 -I 





M-Rep 
(1373) 



M-Other 
(1696) 



U-TissueSpec 
(325) 



F-Rep 
(614) 



F-Other 

(3886) 




M-Rep 
(831) 



M-Other 
(1472) 



U-TissueSpec 
(128) 



F-Rep 
(518) 



F-Other 

(3396) 



Fig. 2. — Evolutionary rates of reproductive and nonreproductive tissue-specific genes. Genes expressed specifically in male or female reproductive 
tissues are labeled as M-Rep and F-Rep, respectively. Genes with male- or female-biased expression that is not limited to reproductive tissues are labeled 
M-Other and F-Other, respectively. Genes with unbiased expression that is limited to a single somatic tissue are labeled U-TissueSpec. Panel (A) displays 
results for the melanogaster lineage. Panel (B) displays results for the obscura lineage. Genes with expression limited to male-reproductive tissues show 
significantly faster rates of evolution than all other groups (Mann-Whitney U test, P < 0.001 in all cases). Genes with expression limited to female- 
reproductive tissues show significantly faster rates of evolution than other female-biased genes for both lineages (P < 0.001 ). Furthermore, these genes 
show significantly faster rates of evolution than unbiased genes with tissue-specific expression for the melanogaster lineage {P = 0.009). For the 
obscura lineage, the same tendency can be observed but is not significant (P = 0.30). 



(Mann-Whitney U test, P < 0.001 for melanogaster; 
P = 0.06 for obscura, fig. 6B and £). However, there 
are no significant differences between the two groups for 
genes with female-biased expression (Mann-Whitney U 
test, P = 0.34 for melanogaster; P = 0.30 for obscura, 
fig. 6C and F). 

The Influence of Sex-Linkage on the Rate of Protein 
Evolution 

Of the 8,439 orthologous genes shared between the mel- 
anogaster and the obscura lineages, 5,509 were autosomal 
in both lineages and 1 ,227 genes were X-linked in both lin- 
eages (i.e., located on the X chromosome in melanogaster 
and on XL in obscura). A total of 1 ,527 genes were located 
on the "neo-X" chromosome in D. pseudoobscura (XR) and 
on the homologous autosomal arm 3L in D. melanogaster In 
addition to these, 88 genes were autosomal in D. mela- 
nogaster and located on XL in D. pseudoobscura, 56 genes 
were X-linked in D. melanogaster and autosomal in D. pseu- 
doobscura, 1 1 genes were X-linked in D. melanogaster and 
located on XR in D. pseudoobscura, and 21 genes were 



autosomal in D. melanogaster (but not on 3L) and located 
on XR in D. pseudoobscura. 

We compared rates of evolution among three groups of 
orthologous genes: 1) genes that were autosomal in both 
lineages (auto-auto), 2) genes that were X-linked in both lin- 
eages (X-XL), and 3) genes that were located on 3L in mel- 
anogaster and on XR in obscura (3L-XR). Kruskal-Wallis tests 
revealed significant differences among these three groups in 
both lineages (P< 0.001 in both cases). For the melanogast- 
er lineage, X-linked genes showed significantly higher d N /d 5 
values than 3L-XR genes (Mann-Whitney U test, P < 0.001 ). 
After Bonferroni correction, there were no significant differ- 
ences between either autosomal and X-linked or autosomal 
and 3L-XR genes (Mann-Whitney U test, P = 0.02 in both 
cases, fig. 7A). However, when we pooled autosomal and 
3L-XR genes into one group, the X-linked genes again 
had significantly faster rates of protein evolution (Mann- 
Whitney test, P < 0.001). For the obscura lineage, autoso- 
mal genes showed the slowest rates of evolution, followed 
by genes located on 3L-XR and X-linked genes (fig. IB). The 
differences between groups were significant for the two 
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Fig. 3. — Comparison of autosomal and X-linked genes. Panel (A) displays results for the melanogaster lineage. Panel (B) displays results for the 
obscura lineage. Numbers of genes per group are given in parentheses. "M" indicates male-biased genes, "F" indicates female-biased genes, and "U" 
indicates unbiased genes. Significance levels for relevant comparisons are given in table 2. 



comparisons involving autosomal genes (Mann-Whitney U 
test, auto-auto vs. 3L-XR: P = 0.0013; auto-auto vs. 
X-linked: P < 0.001 ). When we considered genes that were 
autosomal in the melanogaster lineage and located on the 
"neo-X" in the obscura lineage (3L-XR), rates of molecular 

Table 2 

Significance Levels for Comparisons of d/y/ds between Autosomal and 
X-Linked Genes 



Lineage 


Expression 3 


Comparison 




Melanogaster 


M 


Auto versus X 


<0.001* 




F 


Auto versus X 


0.1741 




U 


Auto versus X 


0.0245 


Obscura 


M 


Auto versus XL 


<0.001* 






Auto versus XR 


0.0047* 






XL versus XR 


<0.001* 




U 


Auto versus XL 


<0.001* 






Auto versus XR 


0.0013* 






XL versus XR 


0.1812 




F 


Auto versus XL 


<0.001* 






Auto versus XR 


0.0052* 






XL versus XR 


0.0077 



a "M" indicates male-biased genes, "F" indicates female-biased genes, and "U" 
indicates unbiased genes. 

b As determined by Mann-Whitney U tests. Values marked with asterisks are 
significant after Bonferroni correction. 



evolution were significantly higher for the obscura lineage, 
again indicating faster-X evolution (Mann-Whitney U test, 
P < 0.001). In accordance with previous findings (Sturgill 
et al. 2007; Jiang and Machado 2009), there is a general 
paucity of male-biased genes on XR relative to the auto- 
somes (x 2 test, P < 0.001). 

Comparison of Orthologous Genes 

Because our previous assessment of the correlation between 
d/v/d s and the M/F expression ratio treated each lineage in- 
dependently, the number and identity of genes were not the 
same in the two lineages. This is because some genes did not 
have clear one-to-one orthologs between lineages or ex- 
pression data were lacking for some genes in one of the lin- 
eages. In order to utilize a common set of genes in the two 
lineages, we repeated the analysis using the set of ortholo- 
gous genes for which expression data were available for 
both lineages. Overall, there was a positive correlation be- 
tween the M/F ratios of orthologs in the two lineages (Spear- 
man's p = 0.76, P < 0.001; fig. SA). There was also 
a positive correlation between the rates of protein evolution 
of orthologous genes in the two lineages (Spearman's p = 
0.37, P < 0.001 ; fig. SB), which was robust to different d 5 
and d/v/d 5 cutoffs (supplementary table 5, Supplementary 
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Fig. 4. — Comparison of autosomal and X-linked reproductive tissue-specific and nonreproductive tissue-specific genes. Panel (A) displays results 
for the melanogaster lineage. Panel (B) displays results for the obscura lineage. Numbers of genes per group are given in parentheses. "M" indicates 
male-biased genes, "F" indicates female-biased genes, and "U" indicates unbiased genes. "Rep" indicates genes with expression limited to male (or 
female) reproductive tissues, "Other" indicates the remaining male-biased (or female-biased) genes, and "TissueSpec" indicates genes with expression 
in only a single somatic tissue. Significance levels for relevant comparisons are given in table 3. 



Material online). The correlation between M/F and d N /d s 
was significant for both lineages (melanogaster. p = 
0.12, P < 0.001; obscura: p = 0.14, P < 0.001, see also 
supplementary table 6, Supplementary Material online). 

In addition, we looked at orthologous genes that showed 
conservation of chromosomal location and those that 
showed conserved expression state between the two line- 
ages but not necessarily conservation of chromosomal loca- 
tion. Again, the positive correlations were significant for 
both lineages and were particularly pronounced for male- 
biased genes in the melanogaster lineage (supplementary 
table 6, Supplementary Material online). This indicates that 
there are differences in sex-biased gene evolution between 
lineages. We performed group-based comparisons for both 



the sets of all orthologs between the two lineages and for all 
orthologs with conserved expression state (MM, FF, and UU, 
respectively). We ordered the genes according to their M/F 
expression ratios and divided the sets into five equally sized 
groups. For both data sets, the observed pattern is consis- 
tent with what we found before for all genes in the two lin- 
eages (supplementary table 7, Supplementary Material 
online). The rate of evolution (d N /d s ) differed significantly 
among the expression groups (Kruskal-Wallis test, P < 
0.001 in all cases) and there was a general pattern of faster 
evolution for genes with a higher degree of male-biased ex- 
pression but not for genes with a higher degree of female- 
biased expression (supplementary table 7, Supplementary 
Material online). 
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Table 3 

Significance Levels for Comparisons of (Vds between Autosomal and 
X-Linked Reproductive Tissue-Specific and Nonreproductive Tissue- 
Specific Genes 



Lineage 


Expression 3 


Comparison 


pb 
r 


Melanogaster 


M-Rep 


Auto versus X 


<0.0001* 




F-Rep 


Auto versus X 


0.9904 




U-TissueSpec 


Auto versus X 


0.0380 




M-Other 


Auto versus X 


0.0201 




F-Other 


Auto versus X 


0.0527 


Obscura 


M-Rep 


Auto versus XL 


0.0001* 






Auto versus XR 


0.8055 






XL versus XR 


0.0028* 




M-Other 


Auto versus XL 


<0.0001* 






Auto versus XR 


0.0256 






XL versus XR 


0.0304 




U-TissueSpec 


Auto versus XL 


0.4508 






Auto versus XR 


0.0106 






XL versus XR 


0.1501 




F-Rep 


Auto versus XL 


0.5156 






Auto versus XR 


0.2353 






XL versus XR 


0.6717 




F-Other 


Auto versus XL 


<0.0001* 






Auto versus XR 


0.0495 






XL versus XR 


0.0275 



a "M" indicates male-biased genes, "F" indicates female-biased genes, and "U" 
indicates unbiased genes. M-Rep/F-Rep = reproductive tissue-specific male- or female- 
biased genes, M-Other/F-Other = all other male- or female-biased genes, U-TissueSpec 
= tissue-specific genes, but only expressed in nonreproductive tissues. 

b As determined by Mann-Whitney U tests. Values marked with asterisks are 
significant after Bonferroni correction. 



Discussion 

By investigating the molecular evolution of sex-biased genes 
in two independent distantly related Drosophilid lineages, 
we have been able to uncover some common features of 
sex-biased gene evolution, as well as features that differ be- 
tween lineages. Consistent with a previous study of the mel- 
anogaster lineage (Meisel 201 1), we find that the rate of 
protein evolution (d N /d s ) is positively correlated with the ra- 
tio of male-to-female gene expression (M/F). This correlation 
is mainly driven by male-biased genes, which show the 
strongest correlation between d N /d 5 and M/F in both line- 
ages (table 1). Furthermore, highly male-biased genes 
and those expressed specifically in reproductive tissues show 
significantly higher d N /d 5 than all other categories of genes 
in both lineages (figs. 1 and 2). We also find that ortholo- 
gous genes that show conserved male-biased expression in 
both lineages show the fastest rates of protein evolution 
(fig. 6). 

For female-biased genes there is a slightly negative, but 
nonsignificant, correlation between d N /d s and M/F in the 
melanogaster lineage (table 1). In other words, genes with 
highly female-biased expression tend to evolve faster than 
those with weakly female-biased expression. This agrees 
qualitatively with Meisel (2011) and our categorical analysis 
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Fig. 5. — Correlation of M/F expression ratios and evolutionary 
rates between lineages. Orthologous genes between the melanogaster 
and the obscura lineage were compared. Spearman rank correlations 
were used to determine correlations between M/F expression ratios and 
evolutionary rates for both lineages. Panel (A) displays the relationship 
between M/F expression ratios (p = 0.76, P < 0.001). Panel {B) displays 
the relationship between evolutionary rates measured by d/v/d 5 (p = 
0.37, P < 0.001). For clarity, 22 points lying outside the boundaries of 
the x and y axes (cVd s > 3.0) are not shown. 



of sex-biased genes (fig. *\A). The negative correlation be- 
tween d N /d 5 and M/F is much stronger when we consider 
only genes that are expressed exclusively in female repro- 
ductive tissues (p = -0.11, P = 0.008) and these genes 
show a stronger female-bias in expression than those that 
are expressed in other tissues (median log 2 (M/F) values of 
-1.25 and -0.66, respectively; Mann-Whitney U test, 
P < 0.001). However, in the obscura lineage, there is a pos- 
itive correlation between d N /d s and M/F for female-biased 
genes (table 1 ), suggesting that highly female-biased genes 
evolve more slowly than weakly female-biased genes in this 
lineage. 

Our findings can explain some previous observations 
about sex-biased gene evolution in the obscura and ananas- 
sae groups. For example, Grath et al. (2009) found that 
a sample of genes with male-biased expression in D. ananas- 
sae did not show accelerated rates of protein evolution in 
the ananassae lineage, although those that showed con- 
served male-biased expression between D. ananassae and 
D. melanogaster did. The lack of an observed accelerated 
rate of evolution for D. ananassae male-biased genes 
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Fig. 6. — Comparison of orthologous genes among D. melanogaster, D. ananassae, and D. pseudoobscura. Panels (A-Q display results using d/y/ds 
between D. melanogaster and D. simulans. Panels (D-F) display results using d/y/ds values between D. pseudoobscura and D. persimilis. "M" indicates 
male-biased genes, "F" indicates female-biased genes, and "U" indicates unbiased genes. Panels (A) and (D) show a comparison of the genes with 
conserved expression states. The first letter indicates the expression state in D. melanogaster, the second letter indicates the expression state in D. 
ananassae, and the third letter indicates the expression state in D. pseudoobscura. There were significant differences among the groups in both lineages 
(Kruskal-Wallis test, P < 0.001) and all pairwise comparisons between groups revealed significant differences (Mann-Whitney (J test, P < 0.001 in all 
cases). Panels (B) and (£) show results for genes that have recently gained male-biased expression (UMU, MUU) compared with genes that have recently 
lost male-biased expression (UMM, MUM). Genes that recently lost male-biased expression showed higher rates of evolution in both comparisons (P < 
0.001 for melanogaster, P = 0.06 for obscura). Panels (O and (F) display the equivalent results for female-biased genes. There are no significant 
differences between the groups (P = 0.34 for melanogaster, P = 0.30 for obscura). 



may be attributable to the fact that the examined genes 
were not chosen because they showed strong male-biased 
expression in D. ananassae, but instead were chosen be- 
cause they were highly male-biased in D. melanogaster 
(Proschel et al. 2006; Baines et al. 2008). Because the degree 
of sex-biased expression is not strongly correlated between 
lineages, the male-biased genes that were examined were 
not those with very high levels of male-biased expression 
D. ananassae (Grath et al. 2009). Since it is the highly 



male-biased genes that show the fastest rate of evolution, 
their absence from the D. ananassae data set may explain 
why an increased rate of molecular evolution was not 
detected on the ananassae lineage. 

An early study of sex-biased gene expression in D. pseu- 
doobscura found that genes with conserved male-biased ex- 
pression between D. pseudoobscura and D. melanogaster 
showed accelerated rates of protein evolution, but those 
with male-biased expression only in D. pseudoobscura did 
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Fig. 7. — Comparison of genes with conserved chromosomal 
location and genes located on 3L-XR. Panel (A) displays results using 
d/v/ds between D. melanogaster and D. simulans. Panel (B) displays 
results using d N /d s values between D. pseudoobscura and D. persimilis. 
"Auto-auto" indicates genes with conserved autosomal location, "3L- 
XR" indicates genes located on 3L and XR, respectively, and "X-XL" 
indicates genes with conserved X-linkage. There were significant 
differences among the groups in both lineages (Kruskal-Wallis test, 
P < 0.001). For all pairwise tests, Bonferroni correction has been 
applied. For the melanogaster lineage, genes with conserved X-linkage 
show significantly higher d/v/ds than genes located on 3L-XR (Mann- 
Whitney U test, P < 0.001). There are no significant differences 
between auto-auto and 3L-XR or between auto-auto and X-XL (P = 
0.02 in both cases) (Panel A). For the obscura lineage, the differences 
were significant for the two comparisons involving conserved autosomal 
genes (auto-auto vs. 3L-XR: P = 0.0013; auto-auto vs. X-XL: P < 0.001). 
There is no significant difference between genes with conserved 
X-linkage and genes located on 3L-XR (P = 0.02) (Panel B). 

not (Metta et al. 2006). In this case, the expression was de- 
termined by SAGE (serial analysis of gene expression) in D. 
pseudoobscura and, given the limited depth of SAGE se- 
quencing, would be expected to identify genes with highly 
male-biased expression. However, because the correlation 
between M/F and d N /d 5 for male-biased genes is weaker 
in the obscura lineage than the melanogaster lineage (table 
1), it may be that the limited sample size of Metta et al. 
(2006) prevented the detection of a difference in molecular 
evolutionary rate among groups of genes. Consistent with 
this, a recent study using whole-genome microarrays and 
comparative genomics of D. pseudoobscura and D. persimi- 
lis detected a significantly elevated rate of protein evolution 
for male-biased genes (Jiang and Machado 2009). 



It is likely that estimates of d N /d 5 between the closely re- 
lated D. pseudoobscura and D. persimilis are inflated by the 
presence of shared ancestral polymorphism (Machado et al. 
2002; Machado et al. 2007; Noor et al. 2007; Kulathinal 
et al. 2009). However, even if it inflates overall divergence, 
the presence of ancestral polymorphism is unlikely to affect 
our conclusions. This is because our analyses were performed 
on groups of genes (male-, female-, and unbiased) compared 
between the same two species. For the elevated d N /d 5 of male- 
biased genes to be explained by shared ancestral polymor- 
phism, there would have to be more shared polymorphism 
in male-biased genes than in the other groups of genes. This 
would require either an overall elevation in nonsynonymous 
polymorphism in male-biased genes, or balancing selection 
maintaining more ancestral nonsynonymous polymorphism 
in male-biased genes. Neither of these possibilities is supported 
by observations in the melanogaster or ananassae lineages, 
where divergence has been calculated between more distantly 
related species (Proschel et al. 2006; Baines et al. 2008; Grath 
et al. 2009), although they cannot be ruled out in the obscura 
lineage with the data presently at hand. 

Another common pattern that we observed is that X- 
linked genes showed elevated d N /d s relative to autosomal 
genes (fig. 3 and table 2). This is consistent with previous 
findings in the melanogaster lineage (Baines et al. 2008) 
and extends the observation to the obscura lineage. A pre- 
vious study did not find evidence for faster-X evolution in 
these two lineages (Thornton et al. 2006). However, that 
study had several limitations: 1) only sex-biased expression 
data from D. melanogaster were used, 2) a complete ge- 
nome sequence from a second species of the obscura line- 
age was not available, and 3) only genes that could be 
identified as conserved orthologs between D. melanogaster 
and D. pseudoobscura were analyzed. All of these factors, 
but especially the last one, contribute to the inability to de- 
tect a significant fast-X effect (Baines et al. 2008). 

In principle, faster-X evolution could result from genetic 
drift having a greater impact on the X chromosome than on 
the autosomes. This would be expected if the effective pop- 
ulation size of the X chromosome was smaller than that of 
the autosomes, allowing a higher proportion of neutral (or 
slightly deleterious) mutations to become fixed on the X 
chromosome (Vicoso and Charlesworth 2009; Mank et al. 
2010). However, in Drosophila there appears to be little dif- 
ference in the effective population sizes of the X chromo- 
some and the autosomes, except in recently derived 
populations (Andolfatto 2001; Kauer et al. 2002; Hutter 
et al. 2007; Parsch et al. 2009). Furthermore, there is abun- 
dant evidence across multiple Drosophila lineages that pos- 
itive selection is the predominant force driving protein 
divergence between species (Sella et al. 2009; Wilson 
et al. 201 1) and the influence of positive selection appears 
to be greater on the X chromosome than on the autosomes 
(Baines et al. 2008; Muller et al. 2012). 
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In both the melanogaster and obscura lineages, the 
fast-X effect was strongest for male-biased genes (fig. 3 
and table 2). If faster-X evolution is mainly driven by an 
increased rate of adaptive substitution on the X chromo- 
some due to the efficient selection of recessive mutations 
in hemizygous males (Charlesworth et al. 1987), then this 
pattern should be expected for two reasons. First, male- 
biased genes tend to show the highest rates of adaptive 
evolution between species (Proschel et al. 2006), indicating 
that a larger fraction of amino acid replacements in 
male-biased genes could be potential targets of positive 
selection. Second, male-biased genes are expected to expe- 
rience selection mainly in a male (hemizygous) genetic back- 
ground where recessive X-linked mutations are immediately 
exposed to selection. Given the strong correlation between 
M/F and 6 N /6 5 observed for male-biased genes, a fast-X 
effect could also be caused by X-linked genes having 
stronger male-biased expression than autosomal genes. 
However, this is not the case, as autosomal male-biased 
genes show higher M/F than X-linked male-biased genes 
on both lineages (Mann-Whitney U test, P < 0.05 in both 
lineages; see also supplementary table 8, Supplementary 
Material online). 

Unbiased genes typically will be subject to selection in 
both sexes and thus, will sometimes encounter selection 
in a hemizygous background. This may explain why unbi- 
ased genes show a fast-X effect that is smaller than that 
of male-biased genes, but still significant, in both the mel- 
anogaster and obscura lineages (table 2). Female-biased 
genes, in contrast, should mainly encounter selection in 
the female genetic background, where X-linked recessive 
mutations have no fixation advantage over autosomal reces- 
sive mutations. This may explain why a fast-X effect is not 
observed for female-biased genes on the melanogaster lin- 
eage (table 2). However, female-biased genes do show 
a significant fast-X effect in the obscura lineage (table 2). 
We investigated if this could be a result of X-linked or 
female-biased genes having lower rates of synonymous di- 
vergence (d 5 ). Whereas genes located on XL have higher d 5 
than genes located on XR or on the autosomes (see below), 
there is indeed lower synonymous divergence for female- 
biased genes in comparison to both male- and unbiased 
(Mann-Whitney U test, P value < 0.001 in both cases). This 
relative reduction in d s may result from there being greater 
constraint on synonymous codon usage for female-biased 
genes (Hambuch and Parsch 2005; Singh et al. 2005), pos- 
sibly because they show greater expression breadth than 
male-biased or unbiased genes (Meisel 2011). However, dif- 
ferences in d 5 cannot explain the observed fast-X effect, as 
X-linked female-biased genes have higher d 5 than autoso- 
mal female-biased genes (Mann-Whitney U test, P < 0.001 ). 

In the obscura lineage, the fast-X effect is strongest for 
comparisons between the autosomes and XL (the ancestral 
X chromosome) but is also evident in comparisons between 



the autosomes and XR (the neo-X chromosome; table 2). 
This cannot be explained by a reduction in d 5 on chromo- 
some arm XL, as d 5 is significantly higher for XL than for 
XR and all autosomes (Kruskal-Wallis test, P < 0.001, pair- 
wise Mann-Whitney U test, P< 0.001 in all cases). Instead, 
the large fast-X effect observed for XL is caused by elevated 
nonsynonymous divergence. This is consistent with there 
being a long-term pattern of faster-X evolution and not 
a brief burst of accelerated evolution following the shift 
from autosomal to X-linkage. 

In summary, the combination of transcriptomic and com- 
parative genomic data has allowed us to investigate patterns 
of sex-biased gene evolution across Drosophila lineages that 
diverged up to 50 Ma. In both the melanogaster and ob- 
scura lineages, we observed an accelerated rate of protein 
evolution for male-biased genes, especially those expressed 
in reproductive tissues, and a positive correlation between 
the degree of male-biased expression and d N /d 5 . For 
male-biased genes, this correlation is stronger in the mela- 
nogaster lineage than in the obscura lineage. The fastest 
evolving genes are those that show conserved male-biased 
expression between lineages. These findings can explain 
some differences between lineages observed in studies that 
used smaller data sets. The separation of genes into sex- 
biased expression groups also reveals a fast-X effect that 
is particularly pronounced for male-biased genes, as would 
be expected if positive selection acts on recessive X-linked 
mutations in hemizygous males. In the pseudoobscura 
lineage, a fast-X effect is also observed for male-biased 
genes on the neo-X chromosome, although the effect is 
smaller than that for genes on the ancestral X chromosome. 
This suggests that long-term X-linkage promotes rapid 
adaptive evolution of genes expressed predominantly in 
males. 



Supplementary Material 

Supplementary tables S1-S8 are available at Genome 
Biology and Evolution online (http://www.gbe. oxfordjournals. 
org/). 
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